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New recombinant herpes viruses, a vaccine based on these 
recombinants, their preparation process, genes, vectors and 
plasmids used in this process. 


Abstract 

The invention concerns the sequence of the unique short 
region Us of Marek's disease virus coding for the kinase protein. 
It also concerns the recombinant herpes viruses in which a hetero¬ 
logous gene has been inserted in the homologous region of the gene 
coding for the kinase protein and, in particular, the Marek recom¬ 
binant viruses expressing a gene of an avian pathogenic agent 
which could possibly be a gene of another serotype of the Marek 
viruses. It also concerns a process for the preparation of these 
recombinants as well as the vaccines obtained. 
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a vaccine based on these 


process, genes, vectors and 


The present invention concerns recombinant herpes 
viruses, in particular, for producing vaccines, their process of 
preparation and the plasmids produced during this process. More¬ 
over, it concerns a part of the chromosome of Marek's disease 
virus (MDV) which can be used for preparing such vaccines. 

Different types of viruses have been used as expression 
10 vectors of foreign genes, in particular of genes coding for anti¬ 
genic proteins, and have proven their potential for immunizing 
animals* The vaccinia virus has, to a great extent, been used for 
constructing recombinant viruses. The herpes viruses have also 
been used: the herpes simplex virus (HSV) (M. Shih et al. f Proc. 
Natl. Acad. Sci., USA. 1984, 81, 5867-5870), the varicella virus 
(VZV) (R. Lowe et al., Proc. Natl. Acad, Sci., USA. 1987, 84, 
3896-3900). The foreign gene is inserted into a fragment of the 
genomic DNA of the herpes virus, corresponding to a non-essential 
region for the viral replication, cloned in a plasmid. This gene 
20 is transferred into the viral genome by homologous recombination. 
This latter step is carried out by cotransfection of the herpes 
genomic DNA and plasmid since this genomic DNA is by nature infec- 
tious. 

Different genes of herpes viruses have been identified 
as non-essential to viral growth, certain of these genes being 
associated with virulence. 
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- The gene of the thymidine kinase of the herpes simplex 
virus (D. Dubbs et al., Virology, 1965, 22, 493-502), of 
the Aujeszky virus (G. Tatarov, Zentralbl. Vet. Med., 

1968, 15 B, 848853), of the rhinotracheal infectious bovine 
virus (S. Kit et al., Virology, 1983, 130, 381-389). 

- The gene gill of the Aujeszky virus (A, Robins et al., J. 
Virol., 1986, 59, 635-645). 

- The gene gX of the Aujeszky virus (D. Thomsen et al., 0. 
Virol., 1987, 61, 229-232). 

- The gene gl of the Aujeszky virus (C. Mettenleiter et al ., 
J. Virol., 1987, 61, 4030-4032). 

Viruses in which one or other of these genes has been 
deleted nonetheless retain the capacity to produce a latent infec¬ 
tion in mice. 

Studies pertaining to the unique short region of the 
genome of the herpes simplex virus HSV-1 have been conducted (B. 
Megnier et al., Virology, 1988, 162, 251-254) and have shown that 
the viruses HSV-1 in the short region of which a gene has been 
deleted, have undergone an attenuation. 

For their part, F.C. Purves et al. (Journal of Virology, 
1987, vol. 61, No. 9, 2896-2901) have demonstrated that the open 
reading frame US3 of the short fragment of the HSV-1 virus genome 
codes for a virus enzyme, the kinase protein, and is not essential 
to the replication of said virus. 

Studies have been undertaken on Marek's disease virus 
which belongs to the subfamily of gamma herpes viruses. This is 
an enveloped virus having a double stranded linear genomic DNA of 
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about 175 kilobases. Its genome is composed of a long segment 
(UL) and of a short segment (Us) fram d by r peated inversed ter¬ 
minal sequences* 

Marek's disease virus causes paralysis and a lympho- 
proliferative disease in chickens, usually, at the age of 2 to 5 
months. This disease results in very significant economic losses 
(L. Payne, Biology of Marek's Disease Virus and the Herpes Virus 
of Turkeys, in The Herpes Virus, vol. 1, pp* 347-431, edited by B. 
Roizman, plenum Press). 

The strains of Marek's disease virus have been 
classified into three serotypest 

- serotype 1 comprises the pathogenic strains and attenuated 
strains derived therefrom. 

- serotype 2 comprises the naturally attenuated strains. 

- serotype 3 comprises the herpes virus of turkeys (HVT) and 
its variants. 

Consequently, the term Marek's attenuated disease virus 
will designate serotypes 1, 2 and 3 at the same time. 

Marek's disease virus (MDV) and herpes virus of turkeys 
(HVT) have similar genomic arrangements (A. Buckmaster et al., J. 
Gen. Virol., 1988, 69, 2033-2042) and numerous homologies of 
sequence all along their genome (C. Gibbs et al., Proc. Acad. 

Natl. Sci. USA, 1984, 81, 3365-3369). 

The chicks are vaccinated at the age of one day and are 
then protected against Marek's disease for their entire life. For 
numerous years, vaccination with the herpes virus of live turkeys 
(HVT) has been very effective for controlling the disease. 
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Nevertheless. the emergence of new viral strains which 
are highly virulent has led to the use of strains of attenuated 
Marek's disease viruses of another serotype, to vaccinate and thus 
increase the level of protection, either, for example, the strain 
CVI 988, MDV serotype 1 attenuated by passing over cells (B. 
Rispens et al., Avian Dis., 1972, 16, 1108-125), or, for example, 
the association of the HVT MDV serotype 3 and SB1 strains, MDV 
serotype 2 (K. Schat et al., J. Natl. Cancer Inst., 1987, 60, 
1075-1082). 


The genome of Marek's virus has certain similarities 
with the genome of alpha viruses, herpes simplex (HSV) and 
chickenpox (VZV), Most of the genes localized in the long region 
LJL of the genome are approximately col inear between the herpes 
simplex virus, chickenpox (D. McGeogh, J. Gen. Virol., 1988, 69, 
1531-1574) and Marek's disease virus (A. Buckmaster, 1988). Thus, 
in the international patent application WO 90/02802, it was pro¬ 
posed that the genes be inserted into this UL region of HVT and 
MDV. 


On the other hand, the localization of genes in the Us 
segment shows a larger divergence between herpes viruses. Also 
among the dozen open reading frames identified for the herpes 
simplex virus, only four have a homology with the chickenpox virus 
(McGeogh, 1988, cited above). 

In fact, the prior art does not suggest that there is an 
interest in proceeding with a homology study of open reading 
frames of the unique short region of the genome of Marek's disease 
virus with the HSV-1 genes. 
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The present invention divulges, for the first time, the 
sequence coding for the kinase protein of the genome of Marek's 
disease virus (MDV) and allows one to establish, in a surprising 
manner, that this gene can be deleted without blocking the viral 
replication and allowing the insertion of heterologous sequences 
opening the way for development of a series of viral expression 
vectors* 


This type of a recombinant and attenuated virus of 
Marek’s disease constitutes a choice candidate for developing a 
viral vector expressing foreign genes to be used for polyvalent 
vaccination of poultry since it has the advantage of being able to 
be used for its own vaccinal properties and as a vaccine against 
other viral, bacterial and parasitic diseases as, for example, 
infectious avian bronchitis, Newcastle's disease, fowl plague, 
egg-drop syndrome, Gumboro's disease, chicken anaemia, coccidio- 
sis, fowl pox, infectious laryngotracheitis, avian colibacillos, 
is pasteurellosis, haemophilosis. 


SUMMARY OF THE INVENTION 


The object of the invention is to provide recombinant 
20 herpes viruses, including a recombinant attenuated Marek's disease 
virus (serotypes 1, 2 or 3), recombinant which can be used as a 
vaccine, the method for constructing such a recombinant virus as 
well as a vector virus allowing the multivalent vaccination 
against viral, bacterial or parasitic avian diseases. 

One object of the invention is the nucleotide sequence 
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and its variants corresponding to the US3 gene which is homologous 
to the kinase protein gene of the herpes simplex virus and the 
surrounding regions. The term variants of the nucleotide se¬ 
quence, as commonly used, means any equivalent sequence such as 

i' 

obtained, for example, by degeneration of the code, minor modifi¬ 
cations, mutations or corresponding to viral variants. 

Another object of the invention is a recombinant virus 
selected from the herpes viruses, including the viruses of 
pseudorabies disease, infectious bovine rhinotracheitis, equine 
10 rhinopneumonitis, feline rhinotracheitis, canine herpes. 

Another object of the invention is also a Marek’s 
disease recombinant virus comprising one or more heterologous 
genes inserted in the region of its genome corresponding to US3 
gene in such a way so as to be expressed. 

By "heterologous gene", one means, in particular, a gene 
coding for a protein or an immunogenic glycoprotein of a viral, 
bacterial or parasitic pathogenic agent, in particular, an agent 
associated with an avian pathology. This also relates to the 
construction of hybrid viruses, for example, by introducing, into 
20 the genome of a turkey herpes virus, genes coding for immunogenes 
of a Marek's disease virus of serotype 1 and/or serotype 2. 

"Heterologous gene" is also intended to mean a gene 
coding for a peptide or a protein, for example, hormone, growth 
factor, immunomodulator• 

The heterologous gene is preferably expressed under the 
control of regulating sequences of US3 gene transcription. One 
can, however, see to it that this expression is either controlled 
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by a promoting sequence coming from another gene of the virus in 
question, for example, the promoter of TK gene, of gA gene, of gB gene 
or from another herpes virus, for example, the promoter of gl gene of 
the infectious bovine rhinotracheitis virus or from gene II or gene 
III of pseudorabies virus* 

Preferably, the start and stop codons of US3 gene are 
substituted by those of the gene to be expressed* 


DESCRIPTION OF THE FIGURES 


Figure 1 shows the construction of the plasmid pMDV 53L 
10 which contains the lacZ gene inserted instead of US3 gene. 

Figure 2 shows the construction of the plasmid pMDV 53 CL 
which contains the lacZ gene under the control of the promoter iE of 
the human cytomegalovirus instead of US3 gene. 

Figure 3 shows the plasmid pMDV 53F which contains the gene 
of the fusion protein of Newcastle's disease virus instead of U53 
gene. 

DESCRIPTION OF THE INVENTION: MATERIALS AND METHODS 

Viral Strain 

The serotype 1 strain RB1B of Marek's disease virus (MDV) 

20 was used (Schat K.A. et al., 1982, Avian Pathol. 11, 593-605). 

The virus culture methods and methods for extraction of 
viral DNA having a high molecular weight have been described (C. Lee 
et al., 1980, J* Gen. Virol., 51, 235-253? N. Ross et al., 1989, 70, 
1789-1804). 
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Cell Culture 

The fibroblasts of chicken embryo (CEP) were cultivated 
in 199 F10, medium supplemented by penicillin, streptomycin, 
fungizone and fetal calf serum, as described (N. Ross, 1975, J. 
Gen. Virol., 28, 37-47) , 

Cloning of Viral DNA 

Generally, the techniques used for the construction of 
recombinant plasmids are those described by T. Maniatis et al. 

(T. Maniatis et al. , 1982, Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor. NY). 

For all cloning and subcloning steps, the vector 
linearized by the appropriate restriction enzymes is 
dephosphorized before ligation. The purification of the DNA 
fragments starting from an agarose gel is done according to the 
technique described by the manufacturer: "Geneclean" (Bio 101, 
San Diego, California, USA). 

Sequencing 

The cloned fragments are sequenced according to the 
classic technique described by Sanger (G. Sanger, S. Nicklen, A. 
Coulson, 1977, Proc. Natl. Acad., USA 74, 5463-5467). The 
sequences after translation have been compared to the published 
sequences of herpes simplex virus and chickenpox (McGeogh, 1985, 

J. Mol. Biol. 181, 1, 13; K. Davison et al., 1986, J. Gen. Virol. 
67, 1759-1816). 

Controlled Mutagenesis 

The DNA fragments, subcloned in the Blue Script vector 
(Stratagene, La Jolla, California, USA) are mutagenesized after 



separation of the DNAs simple fragment with the help of the R408 
helper phage (Stratagene, La Jolla, California, USA) (M. Russel, 

S. Kidd. M. Kelley, 1986, Gene 45, 333-338). 

The mutagenesis procedure and selection of the mutants 
by using the strain CJ 236 dut-, ung- of E. coli (In Vitrogen, San 
Diego, California, USA) is described by T. Kunkel, (T. Kunkel 
1985, Proc. Natl. Acad. Sci., 82, 488-492 and T. Kunkel et al., 

I 

1987, Methods of Enzymology 154, 367-382, Acad. Press). 

In vivo Recombination 

The recombinant viruses are obtained according to the 
conventional techniques of transfection of the sensitive cells, 
such as the calcium phosphate method or the one using the Lipo- 
fectine reactive described by the manufacturer BRL (P.L. Feigner 
et al,, 1987. Proc. Natl. Acad. Sci., USA, 84, 7413). For this, 
the chicken embryo fibroblasts, cultivated to confluence, are 
cotransfected with the genomic DNA and the plasmid carrier of the 
DNA fragment to be inserted, flanked in 5' and 3' by the genome 
sequences which allows the recombination. 

The recombinant viruses can then be screened by hybridi¬ 
zation with an appropriate probe or by plaque colouring. When a 
gene marker, the lacz gene of 0-galactosidase, is inserted into 
the gene of Marek's disease virus, the expression of this gene can 
be followed by adding, to the cell covering, an agarose overcoat 
enclosing the chromogenic substrate for the fl-galactosidase, e.g. 
Xgal (5-bromo-4-chloro-3-indolyl, B.D. 


galactopyranoside). 
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Example 1: Isolating an EcoRI fragment of 5.25 kilobases. 


The viral genomic DNA was digested by the restriction enzyme EcoRI 
and the fragments cloned in the vector pUC 13 (Pharmacia) 
(Yannisch, Perron et al., 1985, Gene 33, 103-119). 

Among the cloned fragments, a fragment of 5.25 kilo- 
bases, localized at the level of the small fragment Us, has been, 

more particularly, analyzed by sequencing (pMDV 05; sequence ID 
no. 1). 

The sequence comprises six open reading frames (ORF). 

10 The translated sequences of 4 of these ORF have a homology with 
the type I HSV virus proteins, localized in the Us fragment, in 
particular, the US3 gene of Marek's disease virus has a homology 
with the gene of the kinase protein of the herpes simplex virus. 

Surprisingly, the study of this region has shown that 
the US3 gene can be deleted without blocking the viral replica¬ 
tion. 


Example 2t Construction of a plasmid pMDV 53L for which the US3 
gene has been replaced by the lacZ gene (Figure 1). 

The EcoRI fragment of 5.25 kilobases stemming from the 
clone pMDV 05 was digested by the Ncol enzyme and the extremities 
thus generated restored by the DNA polymerase Klenow fragment. It 
was then digested by the Kpnl enzyme and the fragment of 1989 
pairs of bases thus liberated was cloned in the Blue Script vector 
linearized by the enzymes EcoRV and Kpnl to give the plasmid pMDV 
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52 having 4947 pairs of bases. 

The Ncoi and Sail sites were respectively introduced to 
the extremities 5* and 3’ of the cloned fragment by controlled 
mutagenesis using the oligonucleotides designated by Seq ID no. 2 
and 3, in the list of attached sequences, which generates the pMDV 

53 plasmid. The lacZ gene was purified from the pMC 1871 plasmid 
(Pharmacia LKB, Uppsala, Sweden) (S.K. Shamira et al. 1983, Gene 
25, 71-82) by digestion by the enzymes Sinai and Sail. 

It was then inserted into the pMDV 53 plasmid, partially 
digested by the Ncoi enzyme, treated by the DNA polymerase Klenow 
and then controlled by the enzyme Sail, which generates 
the plasmide pMDV 53L of about 6687 pairs of bases. 

Example 3: Preparation of a Marek’s disease virus comprising the 
lacZ gene* 


The chicken embryo fibroblasts were cotransfected with 
the total chromosomal DNA of the virus and the DNA of the 
linearized plasmid pMDV 53L (10 to 50 fu 3 ) . The cultures were 
observed for 4 to 6 days until infectious centres appeared. 
Alternatively, the cells were trypsinated after 72 hours, then 
reinoculated (1:1 or 1:2) in a secondary passage, until lysis 
segments were obtained. 

The medium was then replaced by the new medium 
comprising 1% agarose and 0.5% Xgal. 

The plaques which are due to the recombination viruses 
are distinguished by their blue colour. 
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The viruses can thus be purified by plaque purification 
and, after inoculation with healthy cells, give cytopathogenic 
effect shapes coloured in blue in the presence of Xgal. 


Example 4: Construction of the plasmid pMDV 53 CL which comprises 
the lacZ gene under the control of the immediate early promoter of 
the human cytomegalovirus (CMV) (Figure 2). 

The lacZ gene of j3*galactosidase was placed under the 
control of the immediate early promoter of the human 
cytomegalovirus (IECMV) in the vector pCMV-lacZ. 

10 The fragment of about 4500 pairs of bases comprising the 

whole, the promoter IE of the cytomegalovirus and the lacZ gene, 
were digested by the EcoRI enzyme and the extremities filled by 
the DNA polymerase Klenow fragment. it was then digested by the 
Sail enzyme. 

The fragment thus liberated was cloned in the partially 
digested pMDV 53 vector, by the Ncol enzyme, treated by the DNA 
polymerase Klenow fragment and then digested by the Sail enzyme. 
This plasmid composed of about 8200 pairs of bases is called pMDV 
53 CL. 


20 Example 5i Construction of a Marek recombinant virus for which 

the lacZ gene was introduced under control of the immediate early 
promoter of the cytomegalovirus, instead of the US3 gene. 

The CEP were cotransfected with the genomic DNA of the 
virus and from 10 to 50 fig of DNA of the linearized plasmid pMDV 


53 CL. 
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One can thus obtain the recombinant viruses which are 
distinguished by the appearance of blue-coloured infectious 
plaques in the presence of the chromogen substrate Xgal. These 
viruses were purified according to the plaque purification 
technique and allowed to infect the secondary chicken embryo 
fibroblasts. 

The blue-coloured infectious plaques can be obtained in 
the presence of Xgal which shows that the lacZ gene is inserted 
at the locus of the US3 gene. 

a 

10 Example 6: Construction of the pMDV 53F plasmid which comprises 

the gene of the fusion protein of Newcastle's disease virus 
instead of the US3 gene (Figure 3). 

The fusion gene (j. Taylor et al., 1990, J. Virol., 

64, 1441-1450) was introduced in the form of a fragment at the 
blunt ends in the Blue Script vector at the Sinai site to give the 
pNFl plasmid having 5300 pairs of bases. 

Ncol and Sail sites were introduced by controlled 
mutagenesis at the level of the ATG and stop codons of the fusion 
gene, due to the oligonucleotides indexed SEQ ID no. 4 and SEQ ID 
20 no. 5 respectively in the list of the attached sequences. 

The fragment of 1682 pairs of bases Ncol/Sall coining 
from the pNF2 plasmid was inserted into the pMDV 53 vector 
partially digested by the Ncol enzyme and digested by the Sail 
enzyme to give the pMDV 53F plasmid having 5369 bases pairs. 
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Example 7t Construction of a Marek's disease virus comprising the 
fusion protein gene of Newcastle’s disease virus. 

The chicken embryo fibroblasts were cotransfected with 
the total genomic DNA of the virus and 10 to g of linearized 

DNA of the pMDV 53F plasmid. The cultures were observed for the 
appearance of infectious plaques. 

The recombinant viruses were then screened by 
hybridization with a probe including the fusion gene. 

Similar procedures were used for the construction of the 
non-avian recombinant herpes viruses, by inserting a heterologous 
gene in the Us region and, in particular, in the homologous gene 
at US3 gene of Marek's virus* 

The invention also concerns vaccines, live or not, made 
up of or containing recombinant viruses constructed according to 
the invention, or containing immunogenes expressed by these 


viruses. 
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APPENDIX I 


LIST O F SEQUENCES 


SEQ ID no. 1 
Length of sequence: 

Type of molecule sequenced: 
Origin of the molecule: 

Experimental source: 
Characteristics: 


5.255 pairs of bases 
genomic DNA 
Marek's disease virus, 
strain RBI B 
pMDV 05 plasmid 


from 1 to 324 pairs of bases: non-coding region 
from 325 to 1,135 pairs of bases: US1 gene. The gene is 

coded by the complementary DNA segment at 
the indicated sequence and is transcribed 
from right to left (SEQ ID no. l.B) 


from 

623 

to 

1,214 

pairs 

of 

bases: 

US2 gene 


from 

1,215 

to 

1,245 

pairs 

of 

bases: 

non-coding 

region 

from 

1,246 

to 

2,451 

pairs 

of 

bases: 

US3 gene 

from 

2,452 

to 

2,563 

pairs 

of 

bases: 

non-coding 

region 

from 

2,564 

to 

3,004 

pairs 

of 

bases: 

US4 gene 

from 

3,005 

to 

3,190 

pairs 

of 

bases: 

non-coding 

region 

from 

3,191 

to 

4,384 

pairs 

of 

bases: 

US5 gene 

from 

4,385 

to 

4,494 

pairs 

of 

bases; 

non-coding 

region 

from 

4,495 

to 

5,253 

pairs 

of 

bases: 

US6 gene 



CAAAAATTTACArTAGTAATCTTTCTCCGTGGCTTACCAAATCGTCCTCTTGGTATATCCATATCATCGAAC 72 


ATTGTAGCATTGACTCTGCTCATCGTTGTCTTTCAAATGCGCTCGATTGTTGAATCTCTCCTGATGTTAGAA 144 
GTATATGGAAGATAGCCTGGATACATAAGTGATCTAGAAGGGTTTGTTArTGCAGrAATATACAAATTATAC 216 
GTGACACTATAGCGACGGTTGTAGCGATGCACCTAATCGTAATGTGTATACGCCCCATCATGTAATTATATC 288 
TAATTGGTAGCAAGTAGGTCTGTCGAATAACAGCTAATGACTACCGGCTCTACATTTTTTCTGTATTCGTGA 360 


CTTTCCTGTCGCAGTGTAACGAACCGGAATTGCAATCGCATCTCTATCTTCTTTCTTGCAACATTTTCCACA 432 

ACAGAATAATCTGCCGGGTGTACTACTCATTTGAGGTGGTTCGATTTCCGGAGGTTTTAGAGGATTGGGTGG 504* 

GGACCCGAGGATTTTGTATACACATACCATATCACTGTC6CAAAAATGCGCTCTATCTTCT6GGGTGTCGAA 576 

CTTCGGTTCCCATGTAGATGTCAAGAGAGTTTGAATATTGTCGGGA ATG GCC CAC GGC ATA CCG 640 

flat Ala His G1v lit Pro 6 


GAC CAG GTC CCA GAC ACT TTG ATT GCA AGT 

Asp Gin Val Pro Asp Thr Lau lit Ala Sar 

TCG AGO GCA ATG CGA CAT ATA TCT GCC GCC 

Str Sar Ala Dal Arg His lit Sar Ala Ala 

GCA TTA CCA GAA ACT TCA GAT TCC AAC ATC 

Ala Lau Pro Glu Thr Sar Asp $ar Asn lit 

CAT TCT GTG GAA CAT CCT GCA ACA TCT TCA 

His Sar Val Glu His Pro Ala Thr Str Sar 

CTA GTT CCG GCC AAT CCG GTA CCA CGA ACT 

Lau Val Pro Ala Asn Pro Val Pro Arg Thr 

CTT ACT ATC GGT CGA TGT TGC CGA GGA AGA 

Lau Thr I la Gly Arg Cys Cys Arg Gly Arg 

AAT AGG TCT GCA GOT CTG ACG ATT ATG GGC 

Asn Arg Sar Ala Ala Lau Thr iU hat Gly 

TCC ATA CAT TGC TTT ATA AGG AAT ATC CAT 

Sar Da His Cys Pha Da Arg Asn Da His 

CTT CCT GGC AAT CGA TCG CAT TCA TCT AGA 

Lau Pro Gly Asn Arg Sar His Sar Sar Arg 

ACA CCC ATC TTC ACT CCA CCA ATA ATC TTT 

Thr Pro Da Pha Thr Pro Pro Da Da Pha 

CTG ATC TCC AAA TCT TAT ACC TCT GGT AGA 

Lau Da Sar Lys Sar Tyr Thr Sar Gly Arg 


AAC CTT TTT GGC AAA GGA ATA CAT 694 
Asn Lau Pha Gly Lys Gly Da His 24 

CCA ACT ATC CAC AAG CTA TGT GGA 748 
Pro Thr Da His Lys Lau Cys Gly 42 

AAA TAT CCA GAT AGA ACA TCC TGC SO? 

Lys Tyr Pro Asp Arg Thr Sar Cys 60 

AAT AGC CGC ACT ATA AAC 6AA TCC 856 
Asn Sar Arg Thr Da Asn Glu Sar 78 

CCA GTT CCA TCT GGT GGC TTT GTC 9i0 
Pro Val Pro Sar Gly Gly Pha Val 96 

ATT AAC ATG GGT TTG GCA AAA CGG 964 
Da Asn Mat Gly Lau Ala Lys Arg 114 

ACA CCC ACA TCA TCC TGT ATT TGT 1018 
Thr Pro Thr Sar Sar Cys Da Cys 132 

AAA GTA GAT GCA GCA TC-T CTA GAT 1072 
Lys Val Asp Ala Ala Sar Lau Asp i50 

AGT GTG ACT ATA GTT ATC ATG GAC 1126 
Sar Val Thr Da Val Da Mat Asp 168 

TTT ATT GTT AAT AAC TGG GCC GGT 1160 
Pha Da Val Asn Asn Trp Ala Gly 186 

ATA TGAAACAGGGTTAAAACTAGGTAATAG 1240 
Da 197 s 


ACTGGATG TCT TCG AGT CCG GAG GCA GAA ACG ATG GAA TGC GGC ATT TCT TCG TCG 
Hat Sar Sar Sar Pro Glu Ala Glu Thr flat Glu Cys Gly Da Sar Sar Sar 


1296 

214 


AAA GTA CAC GAC TCT AAA ACT 

Lys Val His Asp Sar Lyt Thr 

AAT GGT ACG GAT ACG ACG TTG 

A»n Gly Thr Asp Thr Thr Lau 

GAA GTG ACG GGG GAT GTG GAC 

Glu Val Thr Gly Asp Val Asp 


AAT ACT ACC TAC GGA ATT 
Asn Thr Thr Tyr Gly Da 

TTT GAT ACT TTT CCC GAC 
Pha Asp Thr Pha Pro Asp 

GAT GTG AAG ACT GAG AGC 
Asp Val Lys Thr Glu Sar 


ATA CAT AAC AGC ATC 1350 

Da His Asn Sar Da 232 

AGT ACC GAT AAC GCG 1404 

Sar Thr Asp Asn Ala 250 

TCT CCC GAG TCC CAA 1458 

Ser Pro Glu Sar Gin 268 
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TCT 

GAA 

GAT 

TTG 

TCA 

CCT 

TTT 

GGG 

AAC 

GAT 

GGA 

AAT 

GAA 

TCC 

CCC 

GAA 

ACG 

GTG 

1512 

- Ser 

Glu 

Asp 

Leu 

Ser 

Pro 

Phe 

Gly 

Asn 

Asp 

Gly 

Asn 

Gl u 

Ser 

Pro 

Gl u 

Thr 

Val 

266 

ACG 

GAC 

ATT 

GAT 

GCA 

GTT 

TCA 

GCT 

GTG 

CGA 

ATG 

CAG 

TAT 

AAC 

AAT 

GTT 

TCA 

TCG 

1566 

TKr 

Asp 

lie 

Asp 

Ate 

Val 

Ser 

A l a 

Val 

Are 

net 

Gin 

Tyr 

Asn 

Asn 

Va 1 

Ser 

Ser 

304 

TTA 

TCG 

ccc 

GGA 

TCT 

GAA 

GGG 

TAT 

ATC 

TAT 

GTT 

TGT 

ACA 

AAG 

CGT 

GGG 

GAT 

AAT 

1620 

Leu 

Ser 

Pro 

G 1 y 

Ser 

Glu 

Gly 

T yr 

He 

Tyr 

Val 

Cys 

Thr 

Lys 

Are 

Gly 

Asp 

Asn 

322 

ACC 

AAG 

AGA 

AAA 

GTC 

ATT 

GTG 

AAA 

GCT 

GTG 

ACT 

GGT 

GAC 

AAA 

ACC 

CTT 

GGG 

AGT 

1674 

Thr 

Lys 

Ar$ 

Lys 

Val 

He 

Val 

Lys 

Ala 

Val 

Thr 

Gly 

ASP 

Lys 

Thr 

Leu 

Gly 

Ser 

340 

GAA 

ATT 

GAT 

ATA 

TTA 

AAA 

AAA 

ATG 

TCT 

CAC 

CGC 

TCC 

ATA 

ATT 

AGA 

TTA 

GTT 

CAT 

1728 

Glu 

lie 

ASP 

He 

Leu 

Lys 

Lys 

net 

Ser 

His 

Are 

Ser 

He 

He 

Are 

Leu 

Val 

Hi & 

356 

GCT 

TAT 

AGA 

TGG 

AAA 

nrm 

TCG 

ACA 

GTT 

TGT 

ATG 

GTA 

ATG 

CCT 

AAA 

TAC 

AAA 

TGC 

GAC 

1782* 

Ala 

Tyr 

Ar 9 

Tr p 

Lys 

Ser 

Thr 

Val 

Cys 

net 

Val 

net 

Pro 

Lys 

Tyr 

Lys 

Cys 

Asp 

376 

TTG 

TTT 

ACG 

TAC 

ATA 

GAT 

ATC 

ATG 

GGA 

CCA 

TTG 

CCA 

cta 

AAT 

CAA 

ATA 

ATT 

ACG 

1836 

Leu 

Pht 

Thr 

T yr 

He 

Asp 

He 

net 

Gly 

Pro 

Leu 

Pro 

Leu 

Asn 

Gin 

He 

He 

Thr 

394 

ATA 

GAA 

CGG 

GGT 

TTG 

CTT 

GGA 

GCA 

TTG 

GCA 

TAT 

ATC 

CAC 

GAA 

AAG 

GGT 

ATA 

ATA 

1890 

lie 

Glu 

Ac 9 

Gly 

Leu 

Leu 

Gly 

A) a 

Leu 

Al a 

T yr 

He 

His 

G! u 

Lys 

Gly 

He 

He 

412 

CAT 

CGT 

GAT 

GTA 

AAA 

ACT 

GAA 

AAT 

ATA 

TTT 

TTG 

GAC 

AAA 

CCT 

GAA 

AAT 

GTA 

GTA 

1944 

Hi* 

Are 

Asp 

Val 

Lvs 

Thr 

Glu 

Asn 

lie 

Phe 

Leu 

Asp 

Lys 

Pro 

Glu 

Asn 

Val 

Val 

430 

TTG 

GGG 

GAC 

TTT 

GGG 

GCA 

GCA 

TGT 

AAA 

TTA 

GAT 

GAA 

CAT 

ACA 

GAT 

AAA 

CCC 

AAA 

1998 

Leu 

Gly 

asp 

Phe 

Gly 

Ala 

Ale 

Cy* 

Lys 

Leu 

Asp . 

Glu 

His 

Thr 

Asp 

Lys 

Pro 

Lys 

446 

TGT 

TAT 

GGA 

TGG 

AGT 

GGA 

ACT 

CTG 

GAA 

ACC 

AAT 

TCG 

CCT 

GAA 

CTG 

CTT 

GCA 

CTT 

2052 

Cys 

Tyr 

Gly 

Trp 

Ser 

Gly 

Thr 

Leu 

Glu 

Thr 

Asn 

Ser 

Pro 

Glu 

Leu 

Leu 

Ala 

Leu 

466 

GAT 

CCA 

TAC 

TGT 

ACA 

AAA 

ACT 

GAT 

ATA 

TGG 

AGT 

GCA 

GGA 

TTA 

GTT 

CTG 

TTT 

GAG 

2106 

Alp 

Pro 

Tyr 

Cys 

Thr 

Lys 

Thr 

Asp 

He 

Trp 

Ser 

Ala 

Gly 

Leu 

Val 

Leu 

Phe 

Glu 

484 

ATG 

TCA 

GTA 

AAA 

AAT 

ATA 

ACC 

TTT 

TTT 

GGC 

AAA 

CAA 

GTA 

AAC. 

GGC 

TCA 

GGT 

TCT 

2160 

rut 

Ser 

Val 

Lys 

Asn 

lie 

Thr 

Phe 

Phe 

Gly 

Lvs 

Gin 

Val 

Asn 

Gly 

Ser 

Gly 

Ser 

502 

CAG 

CTG 

AGA 

TCC 

ATA 

ATT 

AGA 

TGC 

CTG 

CAA 

GTC 

CAT 

CCG 

TTG 

GAA 

TTT 

CCA 

CAG 

2214 

Gin 

Leu 

Are 

Ser 

lie 

lie 

Are 

Cys 

Leu 

Gin 

Val 

His 

Pro 

Leu 

Glu 

Phe 

Pro 

Gin 

520 

AAC 

AAT 

TCT 

ACA 

AAC 

TTA 

TGC 

AAA 

CAC 

TTC 

AAG 

CAG 

TAC 

GCG 

ATT 

CAG 

TTA 

CGA 

2268 

Asn 

Asn 

Ser 

Thr 

Asn 

Leu 

Cys 

Lys 

His 

Phe 

Lys 

Gin 

T yr 

Ala 

He 

Gin 

Leu 

Are 

536 

CAT 

CCA 

TAT 

GCA 

ATC 

CCT 

CAG 

ATT 

ATA 

CGA 

AAG 

AGT 

GGT 

ATG 

ACG 

ATG 

GAT 

CTT 

2322 

His 

Pro 

Tyr. 

Ala 

lie 

Pro 

Gin 

He 

He 

Are 

Lys 

Ser 

Gly 

net 

Thr 

net 

Asp 

Leu 

556 

GAA 

TAT 

GCT 

ATT 

GCA 

AAA 

ATG 

CTC 

ACA 

TTC 

GAT 

CAG 

GAG 

TTT 

AGA 

CCA 

TCT 

GCC 

2376 

Glu 

Tyr 

Ala 

He 

Ala 

Lys 

n«t 

Leu 

Thr 

Phe 

ASP 

Gin 

Glu 

Phe 

Arg 

Pro 

Ser 

Ala 

574 

CAA 

GAT 

ATT 

TTA 

ATG 

TTG 

CCT 

CTT 

TTT 

ACT 

AAA 

GAA 

CCC 

GCT 

GAC 

GCA 

TTA 

TAC 

2430 

Gin 

Asp 

He 

Leu 

net 

Leu 

Pro 

Leu 

Phe 

Thr 

Lys 

Glu 

Pro 

Ala 

ASP 

Ala 

Leu 

Tyr 

592 

ACG 

ATA 

ACT 

GCC 

GCT 

CAT 

ATG 

TAAACACCCGTCAAAAATAACTTCAATGATTCATTTTATAATA 

2494 

Thr 

lie 

Thr 

Ala 

Ala 

His 

net 












599 

TATACTACGCGTTACCTQCAATAATGACAACATTCGAAGTCTTTGAAGATTCGCAGACCTTTTTTGCGAATG 

2566 


















net 

* 

600 

GCA 

CCT 

TCG 

GGA 

CCT 

ACG 

CCA 

TAT 

TCC 

CAC 

AGA 

CCG 

CAA 

ATA 

AAG 

CAT 

TAT 

GGA 

2620 

Ala 

Pro 

Ser 

Gly 

Pro 

Thr 

Pro 

Tyr 

Ser 

His 

Are 

Pro 

Gin 

He 

Lvs 

His 

Tyr 

Gly 

618 

ACA 

TTT 

TTG 

GAT 

TGC 

ATG 

AGA 

TAT 

ACT 

CTA 

AAC 

GAT 

GAG 

AGT 

AAG 

GTA 

GAT 

GAT 

2674 

Thr 

Phe 

Leu 

Asp 

Cys 

Het 

Are 

T yr 

Thr 

Leu 

Asn 

Asp 

Gl u 

Ser 

Lys 

Val 

Asp 

Asp 

636 



18 


^ ^ ^ f- '• 

(. ' ‘ *■' \ 


AGA TGT TCA GAC ATA CAT AAC TCC TTA GCA CAA TCC AAT GTT ACT TCA AGC ATG 2723 

Arg Cys Ser Asp lie His Asn Ser Leu Ala Gin $er Asn Val TKr Ser Sen Het 654 

TCT GTA ATG AAC GAT TCG GAA GAA TAT CCA TTA ATA AAT GGA CCT TCG ATG CAG 2782 

Ser Val Met Asn Asp Ser Glu Glu Tvr Pro Leu lie Asn Gly Pro Ser (let Gin 672 

GCA GAG GAC CCT AAA AGT GTT TTT TAT AAA GTT CGT AAG CCT GAC CGA AGT CGT 2836 

Ala Glu ASP Pro LyS Ser Val Phe T yr Lys Val Arg Lys Pro Asp Arg Ser Arg 690 

GAT TTT TCA TGG CAA AAT CTG AAC TCC CAT GGC AAT AGT GGT CTA CGT CGT GAA 2390 

Asp Phe Ser Trp Gin Asn Leu Asn Ser His Gly Asn Ser Gly Leu Arg Arg Glu 703 

AAA TAT ATA CGT TCC TCT AAG AGG CGA TGG AAG AAT CCC GAG ATA TTT AAG GTA 2944 

Lys Tyr He Arg Ser Ser Lys Arg Arg Trp Lys Asn Pro Glu lie ’Phe Lys Val 726 

TCT TTG AAA TGT GAA TCA ATT GGC GCT GGT AAC GGA ATA AAA ATT TCA TTC TCA 2993 

Ser Leu Lys Cys Glu Ser lie Gly Ala Gly Asn Gly He Lys lie Ser Phe Ser 744 

TTT TTC TAACATTATAATATATCAGATCGTT rCTTATATACTTATTTTCATCGTCGGGATATGACTAAC 3067 
Phe Phe 746 

GTATACTAAGTTACAAGAAACAACTGCTTAACGTCGAACATAACGGAAATAAAAATATATATAGCGTCTCCT 3139 

ATAANTGTTATATTGGCACCTTTTAGAGCTTCGGTATGAATAGATACAGATATG AAA GTA TTT TTT 3205 

Met Lys Val Phe Phe 751 

TTT AGA TAT ATC TCA TCC ACG AGA ATG ATT CTT ATA ATC TGT CTA CTT TTG GGA 3259 

Phe Arg Tyr He Ser Ser Thr Arg Met lie Leu lie He Cys Leu Leu Leu Gly 769 

ATT GGG GAC ATG TCC GCA ATG GGA CTT AAG AAA GAC AAT TCT CCG ATC ATT CCC 3313 

lie Gly Asp Met Ser Ala Met Gly Lau Lys Lys Asp Asn Ser Pro lie lie Pro 787 

ACA TTA CAT CCG AAA GGT AAT GAA AAC CTC CGG GCT ACT CTC AAT GAA TAC AAA 3367 

Thr Leu His Pro Lys Gly Asn Glu Asn Leu Arg Ala Thr Leu Asn Glu Tyr Lys 805 

ATC CCG TCT CCA CTG TTT GAT ACA CTT GAC AAT TCA TAT GAG ACA AAA CAC GTA 3421 

lie Pro Ser Pro Leu Phe Asp Thr Leu Asp Asn Ser Tyr Glu Thr Lys His Val 823 

ATA TAT ACG GAT AAT TGC AGT TTT GCT GTT TTG AAT CCA TTT GGC GAT CCG AAA 3475 

He Tyr Thr Asp Asn Cys Ser Phe Ala Val Leu Asn Pro Phe Gly Asp Pro Lys 841 

TAT ACG CTT CTC AGT TTA CTG TTG ATG GGA CGA CGC AAA TAT GAT GCT CTA GTC 3529 

Tyr Thr Leu Leu Ser Leu Leu Leu Met Gly Arg Arg Lys Tyr Asp Ala Leu Vat 859 ’ 

GCA TGG TTT GTC TTG GGC AGA GCA TGT GGG AGA CCA ATT TAT TTA CGT GAA TAT 3583' 
Ala Trp Phe Val Leu Gly Arg Ala Cys Gly Arg Pro XU Tyr Leu Arg Glu Tyr 877 

GCC AAC TGC TCT ACT AAT GAA CCA TTT GGA ACT TGT AAA TTA AAG TCC CTA GGA 3637 

Ala Asn Cys Ser Thr Asn Glu Pro Phe Gly Thr Cys Lys Leu Lys Ser Leu Gly 895 

TGG TGG GAT AGA AGA TAT GCA ATG ACG AGT TAT ATC GAT CGA GAT GAA TTG AAA 3691 

Trp Trp Asp Arg Arg Tyr Ala Met Thr Ser Tyr He Asp Arg Asp 61u Leu Lys 913 

TTG ATT ATT GCA GCA CCC AGT CGT GAG CTA AGT GGA TTA TAT ACG CGT TTA ATA 3745 

Leu 11a He Ala Ala Pro Sar Arg Glu Leu Ser Gly Leu Tyr Thr Arg Leu He 931 

ATA ATT AAT GGA GAA CCC ATT TCG AGT GAC ATA TTA CTG ACT GTT AAA GAA ACA 3799 

He lie Asn Gly Glu Pro He Ser Ser Asp He Leu Leu Thr Va 1 Lys Glu Thr 949 

TGT AGT TTT TCG AGA CGG GGG ATA AAG GAT AAC AAA CTA TGC AAA CCG TTC AGT 3853 

Cys Ser Phe Ser Arg Arg Gly He Lys Asp Asn Lys Leu Cys Lys Pro Phe Ser 967 

TTT TTT GTC AAT GGT ACA ACA CGG CTG TTA GAC ATG GTG GGA ACA GGA ACC CCG 3907 

Phe Phe Val Asn Gly Thr Thr Arg Leu Leu Asp Mat Val Gly Thr Gly Thr Pro 985 

AGA GCT CAT GAA GAA AAT GIG AAG CAG TGG CTT GAA CGA ATT GGT GGT AAA CAT 3961 





1003 


Ar ft 

Ala 

His 

Glu 

Glu 

Asn 

Val 

Lvs 

Gin 

Trp 

Leu 

Glu 

Arg 

lit 

G1 y 

GW 

Lvs 

His 

CTA 

CCA 

ATC 

GTC 

GTC 

GAA 

ACA 

TCT 

ATG 

CAA 

CAA 

GTC 

TCA 

AAT 

TTG 

CCG 

AGA 

AGT 

Leu 

Pro 

XU 

V al 

Val 

Glu 

Thr 

Ser 

Met 

Gin 

Gin 

Val 

Ser 

Asn 

Leu 

Pro 

Arg 

Ser 

TTT 

AG A 

GAT 

TCA 

TAT 

TTC 

AAA 

TCA 

CCT 

GAC 

GAC 

GAT 

AAA 

TAT 

GAT 

GAC 

GTC 

AAA 

Phft 

Arft 

Asp 

$«r 

Tyr 

Ph« 

Lys 

Ser 

Pro 

Asp 

Asp 

Asp 

Lys 

Tyr 

Asp 

Asp 

Val 

LyS 

ATG 

ACA 

TCG 

GCC 

ACT 

ACT 

AAT 

AAC 

ATT 

ACC 

ACC 

TCC 

GTG 

GAT 

GGT 

TAC 

ACT 

GGA 

rut 

Thr 

S*r 

Ala 

Thr 

Thr 

Asn 

Asn 

lit 

Thr 

Thr 

Sar 

Val 

asp 

Gly 

Tyr 

Thr 

G 1 y 

CTC 

ACT 

AAT 

CGG 

CCC 

GAG 

GAC 

TTT 

GAG 

AAA 

GCA 

CCA 

TAC 

ATA 

ACT 

AAA 

CGA 

CCG 

Lau 

Thr 

Asn 

Arft 

Pro 

Glu 

Asp 

Phe 

Glu 

Lys 

Al a 

Pro 

Tyr 

lift 

Thr 

Lys 

Arg 

Pro 

ATA 

ATC 

TCT 

GTC 

GAG 

GAG 

GCA 

TCC 

AGT 

CAA 

TCA 

CCT 

AAA 

ATA 

TCA 

ACA 

GAA 

AAA 

XU 

lift 

Ser 

Vat 

GU 

G1 u 

Ala 

Ser 

Ser 

Gin 

Ser 

Pro 

Lys 

lift 

Ser 

Thr 

Glu 

Lys 

AAA 

TCC 

CGA 

ACG 

CAA 

ATA 

ATA 

ATT 

TCA 

CTA 

GTT 

GTT 

CTA 

TGC 

6 TC 

ATG 

TTT 

TGT 

Lys 

Ser 

Arft 

Thr 

Gin 

lift 

lie 

He 

Ser 

Leu 

Val 

val 

Leu 

Cys 

Val 

rut 

Phe 

Cys 

TTC 

ATT 

GTA 

ATC 

GGG 

TCT 

GGT 

ATA 

TGG 

ATC 

CTT 

CGC 

AAA 

CAC 

CGC 

AAA 

ACG 

GTG 

Phe 

lie 

Vat 

lie 

Gly 

Ser 

Gly 

lie 

Trp 

lift 

Leu 

Ar 9 

Lvs 

His 

Arg 

Lys 

Thr 

Val 

ATG 

tat 

GAT 

AGA 

CGT 

CGT 

CCA 

TCA 

AGA 

CGG 

GCA 

TAT 

TCC 

CGC 

CTA 

TAACACGTGTT 

rift t 

Tyr 

Asp 

Arg 

Arg 

Arg 

Pro 

Sar 

Arg 

Arg 

Ala 

Tyr 

Sar 

Arg 

Lftu 





TGGTATGGGCGTGTCGCTATAGTGCATAAGAAGTTGACTACATTGCATCAATGACATTATATAGCTTCTTTG 


gtcagatagacggcgtgtgtgattgcgatg tat cta cta caa TTA TTA ttt tgg atc cgc 

rut Tyr Leu Leu Gin Leu Leu Phft Trp lift Arg 


CTC 

TTT 

CGA 

GGC 

ATC 

TGG 

TCT 

ATA 

GTT 

TAT 

ACT 

GGA 

ACA 

TCT 

GTT 

ACG 

TTA 

TCA 

Leu 

Phe 

Arg 

Gly 

lift 

Trp 

Sftr 

lift 

Val 

T yr 

Thr 

61 y 

Thr 

Ser 

Val 

Thr 

Ltu 

Ser 

ACG 

GAC 

CAA 

TCT 

GCT 

CTT 

GTT 

GCG 

TTC 

TGC 

GGA 

TTA 

GAT 

AAA 

ATG 

GTG 

AAT 

GTA 

Thr 

Asp 

Gin 

Ser 

Ala 

Leu 

Val 

Ala 

Phe 

Cys 

Gly 

L«u 

Asp 

Lys 

Mat 

Val 

Asn 

Val 

CGC 

GGC 

CAA 

CTT 

TTA 

TTC 

CTG 

GGC 

GAC 

CAG 

ACT 

CGG 

ACC 

AGT 

TCT 

TAT 

ACA 

GGA 

Arg 

Gly 

Gin 

Leu 

Leu 

Phft 

Lftu 

Gly 

Asp 

Gin 

Thr 

Arg 

Thr 

S«r 

Ser 

Tyr 

Thr 

Gly 

ACG 

ACG 

GAA 

ATC 

TTG 

AAA 

TGG 

GAT 

GAA 

GAA 

TAT 

AAA 

TGC 

TAT 

TCC 

GTT 

CTA 

CAT 

Thr 

Thr 

Glu 

lift 

L«u 

Lys 

Trp 

Asp 

Glu 

Glu 

Tyr 

Ly*s 

Cys 

Tyr 

Ser 

Val 

L«u 

His 

GCG 

ACA 

TCA 

TAT 

ATG 

GAT 

TGT 

CCT 

GCT 

ATA 

GAC 

GCC 

ACG 

GTA 

TTC 

* 

AGA 

GGC 

TGT 

Ala 

Thr 

Sftr 

Tyr 

n«t 

Asp 

Cys 

Pro 

Ala 

lift 

Asp 

Ala 

Thr 

Val 

Phft 

Arg 

Gly 

Cvs 

AGA 

GAC 

GCT 

GTG 

GTA 

TAT 

GCT 

CAA 

CCT 

CAT 

GAT 

AGA 

GTA 

CAA 

CCT 

TTT 

CCC 

GAA 

Arg 

Asp 

Ala 

Val 

Val 

Tyr 

Ala 

Gin 

Pro 

His 

Asp 

Arg 

Val 

Gin 

Pro 

Phft 

Pro 

Glu 

AAG 

GGA 

ACA 

TTG 

TTG 

AGA 

ATT 

GTC 

GAA 

CCC 

AGA 

GTA 

TCA 

GAT 

ACA 

GGC 

AGC 

TAT 

Lys 

Gly 

Thr 

Leu 

Lftu 

Arg 

lift 

Val 

Glu 

Pro 

Arg 

Val 

Sftr 

Asp 

Thr 

Gly 

Sar 

Tyr 

TAC 

ATA 

CGT 

GTA 

GCT 

CTC 

GCT 

GGA 

AGA 

AAT 

ATG 

AGC 

GAT 

ATA 

TTT 

AGA 

ATG 

GCT 

Tyr 

XU 

Arg 

Val 

Ala 

Lftu 

Ala 

Gly 

Arg 

Asn 

net 

$«r 

Asp 

lift 

Phft 

Arg 

n«t 

AU 

GTT 

ATT 

ATA 

AGG 

AGT 

AGC 

AAA 

TCT 

TGG 

GCN 

TGT 

AAT 

CAC 

TCT 

GCT 

AGT 

TCA 

TTT 

Val 

lift 

lie 

Arg 

Ser 

Ser 

Lys 

S«r 

Trp 

aa 

Cys 

Asn 

His 

Sftr 

Ala 

Sftr 

Sftr 

Phft 

CAG 

GCC 

CAT 

AAA 

TGT 

ATT 

CGC 

TAT 

GTC 

GAC 

CGT 

ATG 

GCC 

TTT 

GAA 

AAT 

TAT 

CTG 

Gin 

All 

HI S 

Lys 

Cys 

1U 

Arg 

Tyr 

val 

ASP 

Arg 

n«t 

Ala 

Phe 

Glu 

Asn 

Tyr 

Lftu 

ATT 

GGA 

CAT 

GTA 

GGC 

AAT 

TTG 

CTG 

GAC 

AGT 

GAC 

TCG 

GAA 

TTG 

CAT 

GCA 

ATT 

TAT 

1U 

Gly 

HiS 

Val 

Gly 

Asn 

Leu 

Lftu 

Asp 

Ser 

Asp 

Ser 

Glu 

Lau 

His 

Ala 

1U 

Tyr 

AAT 

ATT 

ACT 

CCC 

CAA 

TCC 

ATT 

TCC 

ACA 

GAT 

ATT 

AAT 

ATT 

ATA 

ACG 

ACT 

CCA 

TTT 

Asn 

lift 

Thr 

Pro 

Gin 

S«r 

lift 

S«r 

Thr 

Asp 

XU 

Asn 

lift 

lie 

Thr 

Thr 

Pro 

Phe 


4015 

1021 

406 ? 

1039 

4123 

1057 

4177 

1075 

423 \ X 
1093 

4265 

ini 

4339 

1129 

4395 

1144 

4467 

4527 

1155 

4551 

1173 

4635 

1191 

4669 

1209 

4743 

1227 

4797 

1245 

4651 

1263 

4905 

1261 

4959 

1299 

5013 

1317 

5067 

1335 

5121 

1353 

5175 

1371 



. TAC GAT AAT TCG GGA ACA ATT TAT TCA CCT ACG GTT TTT AAT TTG TTT AAT AAC 
Tyr Asp Asn S«r Gly Thr He Tyr Set Pro Thr Val Phe Asn L««j Ph* Asn Asn 


5229 

1339 


. AAT TCC CAT GTC GAT GCA ATG AAT TC 
Asn Ser His Val Asp Ala net Asn 


3255 

1397 
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26361-74 


SEQ ID no. 1 B 
Length of sequence: 

Type of molecule sequenced: 

Origin of the molecule: 

> 

Experimental source: 


1/188 pairs of bases 
genomic DNA 

Marek's disease virus, strain 
RBI B 

pMDV 05 plasmid 


Characteristics: 


from 

from 


1 to 324 pairs of bases: non-coding region 
325 to 1,135 pairs of bases: US1 gene: the gene is 

transcribed from right to left; the 
indicated sequence is complementary to 
the sequence SEQ ID no. 1 
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'3 AG AT CAGACCG6CCCAGTTATTAACAATAAAAAASATT 


Met 

IU 

Thr 

!U 

V 3 1 

Thr 

Leu 

Leu 

Asp 

Glu 

AT6 

ATA 

ACT 

ATA 

GTC 

ACA 

CTT 

CTA 

3 AT 

GAA 

Arp 

Asp 

Ala 

Ala 

Ser 

Thr 

Leu 

Trp 

lift 

Phe 

AGA 

GAT 

GCT 

GCA 

TCT 

ACT 

TTA 

TGG 

AT a 

TTC 

lie 

G1 n 

Asp 

Asp 

Val 

GW 

Val 

Pro 

He 

He 

ATA 

CAG 

GAT 

GAT 

GTG 

GGT 

GTG 

CCC 

ATA 

ATC 

Phe 

A1 a 

Lys 

Pro 

Met 

Leu 

lie 

Leu 

Pro 

Arg 

TTT 

GCC 

AAA 

ccc 

ATG 

TTA 

ATT 

CTT 

CCT 

CGG 

Lys 

Pro 

Pro 

Asp 

GW 

Thr 

GW 

Val 

Ar g 

G ly 

AAG 

CCA 

CCA 

GAT 

GGA 

ACT 

GGA 

GTT 

CGT 

GGT 

Ser 

Phe 

XU 

Val 

Arg 

Leu 

Phe 

Glu 

Asp 

Val 

TCG 

TTT. 

ATA 

GTG 

CGG 

CTA 

TTT 

GAA 

GAT 

GTT 

Asp 

Val 

Leu 

Ser 

61 v 

Tyr 

Leu 

Met 

Leu 

Glu 

GAT 

GTT 

CTA 

TCT 

GGA 

TAT 

TTG 

ATG 

TTG 

GAA 

HU 

Ser 

Leu 

Trp 

XI e 

Val 

3 ly 

Ala 

Ala 

Asp 

CAT 

AGO 

TT5 

TGG 

ATA 

GTT 

GGG 

GCG 

GCA 

GAT 

lie 

Pro 

Leu 

Pro 

Lys 

Arg 

Leu 

Leu 

A) a 

He 

ATT 

CCT 

TTG 

CCA 

AAA 

AGG 

TTA 

CTT 

GCA 

ATC 

Met 

Pro 

Trp 

A1 a 

11 e 

Pro 

As> 

Asn 

He 

G1 r» 

ATG 

CCG 

TGG 

GCC 

ATT 

CCC 

GAC 

AAT 

ATT 

CAA 

Pro 

Lys 

Phe 

Asp 

Thr 

Pro 

Glu 

Asp 

Arg 

A1 a 

CCG 

AAG 

TTC 

GAC 

ACC 

CCA 

GAA 

GAT 

AGA 

GCG 

Cys 

Val 

Tyr 

Lys 

lie 

Leu 

GW 

Ser 

Pro 

Pro 

TGT 

GTA 

TAC 

AAA 

ATC 

CTC 

GGG 

TCC 

CCA 

CCC 

Glu 

Pro 

Pro 

Glr. 

Met 

Ser 

Ser 

Thr 

Pro 

GW 

GAA 

CCA 

CCT 

CAA 

ATG 

AGT 

AGT 

ACA 

CCC 

GGC 

Cys 

Ly s 

Lys 

Glu 

* 

Asp 

Arg 

Asp 

AU 

1U 

Ala 

TGC 

AAG 

AAA 

GAA 

GAT 

AGA 

GAT 

GCG 

ATT 

GCA 

GW 

Lys 

Ser 

Are 

He 

Gin 

Lys 

Lvs 

Cys 

Arg 

GGA 

AAG 

TCA 

CGA 

ATA 

CAG 

AAA 

AAA 

TGT 

AGA 


Met GW Val Ser 

TTG6TG6AGTSAAG.AT6 GGT GTG TCC 1189 


Cys 

Asp 

Arg 

Leu 

Pro 

Gly 

Ar g 

Ser 


TGC 

GAT 

CGA 

TTG 

CCA 

GGA 

AGA 

TCT 

1113 

Leu 

He 

Lys 

61 ft 

Cys 

Met 

61 u 

61 h 


CTT 

ATA 

AAG 

CAA 

TGT 

ATG 

GAA 

CAA 

1069 

Val 

Arg 

Ala 

AU 

ASP 

Leu 

Phe 

Arg 


GTC 

AGA 

GCT 

GCA 

GAC 

CTA 

TTC 

CGT 

1015* 

Gin 

His 

Arg 

Pro 

lie 

Val 

Arg 

Thr 


CAA 

CAT 

CGA 

CCG 

ATA 

GTA 

AGG 

ACA 

961 

Thr 

61 y 

Leu 

AU 

GW 

Thr 

Arg 

Asp 


ACC 

GGA 

TTG 

GCC 

GGA 

ACT 

AGG 

GAT 

907 

AU 

61 y 

Cys 

Ser 

Thr 

Glu 

Trp 

Gin 


GCA 

GGA 

TGT 

TCC 

ACA 

GAA 

TGG 

CAG 

8S3 

Ser 

Glu 

Val 

Ser 

Gly 

Asn 

Al a 

Pro 


TCT 

GAA 

GTT 

TCT 

GGT 

AAT 

GCT 

CCA 

799 

He 

Cys 

Arg 

He 

A1 a 

Leu 

Glu 

Cys 


ATA 

TGT 

CGC 

ATT 

GCG 

CTC 

GAA 

TGT 

74 S 

Lys 

Val 

Ser 

GW 

Thr 

Trp 

Ser 

■31 y 


AAA 

GTG 

TCT 

GGG 

ACC 

TGG 

TCC 

GGT 

691 

Thr 

Leu 

Leu 

Thr 

Ser 

Thr 

Trp 

61 u 

637 

ACT 

CTC 

TTG 

ACA 

TCT 

ACA 

TGG 

GAA 

Hi S 

Phe 

Cys 

ASP 

Ser 

Asp 

Met 

Val 

S83 

CAT 

TTT 

TGC 

GAC 

AGT 

GAT 

ATG 

GTA 

Ash 

Pr o 

Leu 

Ly i 

Pro 

Pro 

Glu 

lie 

S29 

AAT 

CCT 

CTA 

AAA 

CCT 

CCG 

GAA 

ATC 

Arg 

Leu 

Phe 

Cys 

Cys 

GW 

Lvs 

Cys 

47 S 

AGA 

TTA 

TTC 

TGT 

TGT 

GGA 

AAA 

TGT 

lie 

Pro 

Val 

Arg 

Tyr 

Thr 

Al a 

Thr 

367 . 

ATT 

CCG 

GTT 

CGT 

TAC 

ACT 

GCG 

ACA 


Ala Gly Ser Hit 

GCC GGT AGT CAT TAGCTGTTATTCGAC 310 


AGACCTACTTGCTACCAATTAGATATAATTACATGATGGGGC5TATACACATTACGATTAGST6CATCGCTA 2S» , 


CAACCGTCGCTATAGTGTCACGTATAATTTGTATATTACTGCAATAACAAACCCrTCTAGArCACTTATGTA 166 


TCCAGGCTATCT7CCATATACTTCTAACATCAGGAGAGATTCAACAATCGAGCGCATTTGAAA6ACAACGAT 94 


GAGCAGAGTCAATGCTACAATGTTC6ATGATATGGATATACCAAGAGGACGATTTGGTAAGCCACCGAGAAA 22 


GATTACTAATGTAAATTTTTG 
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SEQ ID no. 2 

Type of sequence: 

Length of sequence: 

Type of molecule: 

. 5 * GGA CTC GAA CCA TGG 

\ _I 

NCO 1 


oligonucleotide 


DNA 

AGT CTA TTA CC 3’ 


SEQ ID no. 3 

Type of sequence: oligonucleotide 

Length of sequence: 

Type of molecule: DNA 

5 1 GAC GGG TGT CGA CAT ATG AG 3 1 

i_I 

Sail 

SEQ ID no, 4 

Type of sequence: oligonucleotide 

Length of sequence: 

Type of molecule: DNA 

5' CTG GAG CCC ATG GTG CAC CTT TG 3' 

I_I 

NC01 

SEQ ID no. 5 

Type of sequence: oligonucleotide 

Length of sequence: 

Type of molecule: DNA 

5 * CAA ATT GCT ATT GTC GAC ACC TCC GCC TCT C 3' 


Sail 



24 


26361-74 


THE EMBODIMENTS OF THE INVENTION IN WHICH AN EXCLUSIVE 
PROPERTY OR PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS: 

!• Nucleotide sequence, and its variants, from the unique 

short domain (Us) of the virus of Marek's disease, characterized 
in that it corresponds to the US3 gene, is non-essential to 
replication, and is homologous with the kinase protein gene of the 
herpes simplex virus, 

2* Nucleotide sequence US3 of the MDV virus, and his 

variants, according to claim 1 appearing on the sequence ID 
no. 1. 


3. Nucleotide sequence ID no. 1 and genes and their 
variants including this sequence. 

4. Recombinant virus selected from the herpes virus, 
comprising at least one heterologous gene inserted into the Us 

region of the genome of said virus corresponding to the gene of 
the kinase protein. 

3* Recombinant virus according to claim 4, characterized in 

that the gene is a coding gene for a viral, bacterial or parasitic 
immunogene. 


6- Recombinant virus of Marek's disease according to claim 

5, comprising at least one heterologous gene, characterized in 
that this gene is inserted into the region of its genome 
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corresponding to the US3 gene, in such a way so as to be able to 
be expressed* 

7. Recombinant virus according to claim 6, characterized i 
that the inserted heterologous gene codes for a pathogen selected 
from the group consisting of infectious avian bronchitis, 
Newcastle's disease, Gumboro's disease, fowl plague, chicken 
anaemia, egg-drop syndrome, fowl pox, infectious laryngo- 
tracheitis, avian coli bacillosis, pasteurelosis, coccidiosis, 
haemophilosis. 

8. Recombinant virus according to one of claims 6 and 7, 
characterized in that this is the MDV virus, 

9. Recombinant virus according to one of claims 6 and 7, 
characterized in that this is the HVT virus. 

10. Recombinant virus according to claim 9, characterized 
in that the inserted gene codes for an immunogene of a Marek's 
disease virus of serotype 1 or 2. 

11. Recombinant virus according to any one of claims 4 to 
10, characterized in that the gene is inserted in order to be 
expressed under the control of the transcription regulating 
sequences of the gene of the kinase protein. 
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12. Recombinant virus according to one of claims 4 to 11, 
charact rized in that the heterologous gene inserted is likely to 
be expressed under the control of promoting sequences of the virus 
in question, or other herpes viruses. 

13. Recombinant virus according to any one of claims 4 to 
12, characterized in that the start and stop codons of the gene 
inserted are substituted for those of the US3 gene. 

14. Vaccine characterized in that it comprises a recombinant 
virus according to any one of claims 3 to 9. 

15. Process for preparation of a recombinant virus of 
Marek's disease, characterized in that at least one heterologous 
gene is inserted into the region of its genome corresponding to 
the US3 gene, m such a way so as to be able to be expressed. 
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